On Benchmarking of Invoice Analysis Systems

نویسندگان

  • Bertin Klein
  • Stefan Agne
  • Andreas Dengel
چکیده

An approach is presented to guide the benchmarking of invoice analysis systems, a specific, applied subclass of document analysis systems. The state of the art of benchmarking of document analysis systems is presented, based on the processing levels: Document Page Segmentation, Text Recognition, Document Classification, and Information Extraction. The restriction to invoices enables and requires a more purposeful, i.e. detailed, targetting of the benchmarking procedures (acquisition of ground truth data, system runs, comparison of data, condensation into meaningful numbers). Therefore the processing of invoices is dissected. The involved data structures are elicited and presented. These are provided, being the building blocks of the actual benchmarking of invoice analysis systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recognition of Invoices from Scanned Documents

In this paper, we describe the work of recognition the first page of an invoice from a set of scanned business documents. This can be applied to document management systems, document analysis systems, pre-processing of information extraction systems. We also present our experiments on Czech and English invoice data set.

متن کامل

Seizing the Treasure: Transferring Layout Knowledge in Invoice Analysis

This paper deals with the transfer of knowledge on invoice document layout and extraction strategies. This knowledge has been automatically generated by self-teaching mechanisms of the invoice analysis software smartFIX over several years of operation. We present results of analyzing this “treasure” of knowledge and putting it to use in smartFIX systems of new users. The evaluation shows that t...

متن کامل

Results of a Study on Invoice-Reading Systems in Germany

Companies order, receive, and pay for goods. Hence they continually receive and process invoices. For the most part these are printed on paper and are dealt with manually, so that each invoice after receipt involves processing costs of about 9 Euro on average. Often, human searching and typing of data into computer forms is required to transfer the information from paper into the computer, e.g....

متن کامل

A Part based Modeling Approach for Invoice Parsing

Automated invoice processing and information extraction has attracted remarkable interest from business and academic circles. Invoice processing is a very critical and costly operation for participation banks because credit authorization process must be linked with the real trade activity via invoices. The classical invoice processing systems first assign the invoices to an invoice class but an...

متن کامل

Benchmarking Sustainability with Respect to Transportation Supply and Demand

This paper is an endeavor to quantify the concept of sustainable transportation. The prevailing idea in the context of sustainable development (SD) emphasizes on the reduction of transportation demand in order to reduce the environmental and social consequences of it. Nevertheless, in the current paper using a measure for SD, and based on the conformity of the growths of all sectors with transp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006